31 research outputs found

    Invertible Zero-Shot Recognition Flows

    Get PDF
    © 2020, Springer Nature Switzerland AG. Deep generative models have been successfully applied to Zero-Shot Learning (ZSL) recently. However, the underlying drawbacks of GANs and VAEs (e.g., the hardness of training with ZSL-oriented regularizers and the limited generation quality) hinder the existing generative ZSL models from fully bypassing the seen-unseen bias. To tackle the above limitations, for the first time, this work incorporates a new family of generative models (i.e., flow-based models) into ZSL. The proposed Invertible Zero-shot Flow (IZF) learns factorized data embeddings (i.e., the semantic factors and the non-semantic ones) with the forward pass of an invertible flow network, while the reverse pass generates data samples. This procedure theoretically extends conventional generative flows to a factorized conditional scheme. To explicitly solve the bias problem, our model enlarges the seen-unseen distributional discrepancy based on a negative sample-based distance measurement. Notably, IZF works flexibly with either a naive Bayesian classifier or a held-out trainable one for zero-shot recognition. Experiments on widely-adopted ZSL benchmarks demonstrate the significant performance gain of IZF over existing methods, in both classic and generalized settings

    Bayesian Zero-Shot Learning

    Get PDF
    Object classes that surround us have a natural tendency to emerge at varying levels of abstraction. We propose a Bayesian approach to zero-shot learning (ZSL) that introduces the notion of meta-classes and implements a Bayesian hierarchy around these classes to effectively blend data likelihood with local and global priors. Local priors driven by data from seen classes, i.e., classes available at training time, become instrumental in recovering unseen classes, i.e., classes that are missing at training time, in a generalized ZSL (GZSL) setting. Hyperparameters of the Bayesian model offer a convenient way to optimize the trade-off between seen and unseen class accuracy. We conduct experiments on seven benchmark datasets, including a large scale ImageNet and show that our model produces promising results in the challenging GZSL setting

    Report on the BTAS 2016 Video Person Recognition Evaluation

    Full text link
    © 2016 IEEE. This report presents results from the Video Person Recognition Evaluation held in conjunction with the 8th IEEE International Conference on Biometrics: Theory, Applications, and Systems (BTAS). Two experiments required algorithms to recognize people in videos from the Point-and-Shoot Face Recognition Challenge Problem (PaSC). The first consisted of videos from a tripod mounted high quality video camera. The second contained videos acquired from 5 different handheld video cameras. There were 1,401 videos in each experiment of 265 subjects. The subjects, the scenes, and the actions carried out by the people are the same in both experiments. An additional experiment required algorithms to recognize people in videos from the Video Database of Moving Faces and People (VDMFP). There were 958 videos in this experiment of 297 subjects. Four groups from around the world participated in the evaluation. The top verification rate for PaSC from this evaluation is 0.98 at a false accept rate of 0.01 - a remarkable advancement in performance from the competition held at FG 2015

    LifeCLEF 2016: Multimedia Life Species Identification Challenges

    Get PDF
    International audienceUsing multimedia identification tools is considered as one of the most promising solutions to help bridge the taxonomic gap and build accurate knowledge of the identity, the geographic distribution and the evolution of living species. Large and structured communities of nature observers (e.g., iSpot, Xeno-canto, Tela Botanica, etc.) as well as big monitoring equipment have actually started to produce outstanding collections of multimedia records. Unfortunately, the performance of the state-of-the-art analysis techniques on such data is still not well understood and is far from reaching real world requirements. The LifeCLEF lab proposes to evaluate these challenges around 3 tasks related to multimedia information retrieval and fine-grained classification problems in 3 domains. Each task is based on large volumes of real-world data and the measured challenges are defined in collaboration with biologists and environmental stakeholders to reflect realistic usage scenarios. For each task, we report the methodology, the data sets as well as the results and the main outcom

    Does advancing male age influence the expression levels and localisation patterns of phospholipase C zeta (PLCζ) in human sperm?

    Get PDF
    Socio-economic factors have led to an increasing trend for couples to delay parenthood. However, advancing age exerts detrimental effects upon gametes which can have serious consequences upon embryo viability. While such effects are well documented for the oocyte, relatively little is known with regard to the sperm. One fundamental role of sperm is to activate the oocyte at fertilisation, a process initiated by phospholipase C zeta (PLCζ), a sperm-specific protein. While PLCζ deficiency can lead to oocyte activation deficiency and infertility, it is currently unknown whether the expression or function of PLCζ is compromised by advancing male age. Here, we evaluate sperm motility and the proportion of sperm expressing PLCζ in 71 males (22–54 years; 44 fertile controls and 27 infertile patients), along with total levels and localisation patterns of PLCζ within the sperm head. Three different statistical approaches were deployed with male age considered both as a categorical and a continuous factor. While progressive motility was negatively correlated with male age, all three statistical models concurred that no PLCζ–related parameter was associated with male age, suggesting that advancing male age is unlikely to cause problems in terms of the sperm’s fundamental ability to activate an oocyt

    P-ODN: Prototype-based Open Deep Network for Open Set Recognition

    No full text

    Large-scale learning for media understanding

    No full text

    Meta-Recognition: The Theory and Practice of Recognition Score Analysis

    No full text
    Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)In this paper, we define meta-recognition, a performance prediction method for recognition algorithms, and examine the theoretical basis for its postrecognition score analysis form through the use of the statistical extreme value theory (EVT). The ability to predict the performance of a recognition system based on its outputs for each match instance is desirable for a number of important reasons, including automatic threshold selection for determining matches and nonmatches, and automatic algorithm selection or weighting for multi-algorithm fusion. The emerging body of literature on postrecognition score analysis has been largely constrained to biometrics, where the analysis has been shown to successfully complement or replace image quality metrics as a predictor. We develop a new statistical predictor based upon the Weibull distribution, which produces accurate results on a per instance recognition basis across different recognition problems. Experimental results are provided for two different face recognition algorithms, a fingerprint recognition algorithm, a SIFT-based object recognition system, and a content-based image retrieval system.33816891695ONR [N00014-07-M-0421, N00014-09-M-0448]US National Science Foundation (NSF) [065025]Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)ONR [N00014-07-M-0421, N00014-09-M-0448]US National Science Foundation (NSF) [065025]FAPESP [2010/05647-4

    Toward Open Set Recognition

    No full text
    Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)To date, almost all experimental evaluations of machine learning-based recognition algorithms in computer vision have taken the form of 'closed set'recognition, whereby all testing classes are known at training time. A more realistic scenario for vision applications is 'open set'recognition, where incomplete knowledge of the world is present at training time, and unknown classes can be submitted to an algorithm during testing. This paper explores the nature of open set recognition and formalizes its definition as a constrained minimization problem. The open set recognition problem is not well addressed by existing algorithms because it requires strong generalization. As a step toward a solution, we introduce a novel '1-vs-set machine,'which sculpts a decision space from the marginal distances of a 1-class or binary SVM with a linear kernel. This methodology applies to several different applications in computer vision where open set recognition is a challenging problem, including object recognition and face verification. We consider both in this work, with large scale cross-dataset experiments performed over the Caltech 256 and ImageNet sets, as well as face matching experiments performed over the Labeled Faces in the Wild set. The experiments highlight the effectiveness of machines adapted for open set evaluation compared to existing 1-class and binary SVMs for the same tasks.35717571772ONR MURI [N00014-08-1-0638]Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Army SBIR [W15P7T-12-C-A210]MicrosoftFundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)ONR MURI [N00014-08-1-0638]FAPESP [2010/05647-4]Army SBIR [W15P7T-12-C-A210
    corecore